Measurement Study of the Web Through a Spam Lens
نویسنده
چکیده
Spam messages are capable of carrying links to disconnected portions of the Internet. This paper looks the web as it is visible through URLs embedded in spam. We perform a study of spam using three sources: a spam honeypot, a group of high-spam student inboxes and a newsgroup devoted to posting spam messages. Our results show that 96% of spam links point to sites not reacheable by crawlers and that most of these sites are not reviewed for safety by security companies. We show that some of these contain sophisticated exploits and argue that security companies need to include URLs found in spam in their analysis.
منابع مشابه
The Symbiosis of Human and Semantic Technology Through the Lens of Actor-Network Theory
Background: Semantic technologies (STs) have made machine reasoning possible by providing intelligent data management methods. This capability has created new forms of interaction between humans and STs, which is called "semantic interaction." The increasing spread of this form of interaction in daily life reveals the need to identify the factors affecting it and introduce the requirements of...
متن کاملAnalysis of Web Spam for Non-English Content: Toward More Effective Language-Based Classifiers
Web spammers aim to obtain higher ranks for their web pages by including spam contents that deceive search engines in order to include their pages in search results even when they are not related to the search terms. Search engines continue to develop new web spam detection mechanisms, but spammers also aim to improve their tools to evade detection. In this study, we first explore the effect of...
متن کاملSociological Impact of Using Digital (Web-based) Analyses on Performance Measurement and Optimization of Digital Marketing among Young Managers (Case study: Digital-based Companies in Tehran)
This research aims to study the effect of using digital (web-based) analyses in performance measurement and optimization of digital marketing in digital-based companies in Tehran. The data collection tool was a researcher-made questionnaire. A panel of experts and supervisor were asked to measure the validity of the questionnaire. For reliability analysis of this tool, Cronbach’s alpha test was...
متن کاملMeasurement of the Adsorbed Radiation Dose to Eyelens During CT Scan and Radiotherapy of Nasopharynx Cancer
Introduction: The study of the cause of death during the last few decades has shown that death due to infectious diseases has been declining and has been rising due to noninvasive diseases, especially cancers and accidents. Cancer is considered as one of the fatal diseases, and every year, more than 10.9 million people worldwide are diagnosed with the disease. Mater...
متن کاملLink-Based Characterization and Detection of Web Spam
We perform a statistical analysis of a large collection of Web pages, focusing on spam detection. We study several metrics such as degree correlations, number of neighbors, rank propagation through links, TrustRank and others to build several automatic web spam classifiers. This paper presents a study of the performance of each of these classifiers alone, as well as their combined performance. ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2007